AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Multimodal Large Language Model Visual Backbone

# Multimodal Large Language Model Visual Backbone

Mlcd Vit Large Patch14 336
Apache-2.0
A visual feature extraction model based on ViT-L/14@336px architecture, surpassing CLIP benchmarks in multiple multimodal tasks
Multimodal Fusion Safetensors
M
DeepGlint-AI
1,450
10
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase